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The uncertainty principle, which states that certain sets of quantum-mechanical measurements 
have a minimal joint uncertainty, has many applications in quantum cryptography. But in such 
applications, it is important to consider the effect of a (sometimes adversarially controlled) memory 
that can be correlated with the system being measured: The information retained by such a memory 
can in fact diminish the uncertainty of measurements. Uncertainty conditioned on a memory was 
considered in the past by Berta et al. [T], who found a specific uncertainty relation in terms of 
the von Neumann conditional entropy. But this entropy is not the only measure that can be used 
to quantify conditional uncertainty. In the spirit of recent work by several groups nn, here we 
develop a general operational framework that formalizes the concept of conditional uncertainty in 
a measure-independent form. Our formalism is built around a mathematical relation that we call 
conditional majorization. We define and characterize conditional majorization, and use it to develop 
tools for the construction of measures of the conditional uncertainty of individual measurements, 
and also of the joint conditional uncertainty of sets of measurements. We demonstrate the use 
of this framework by deriving measure-independent conditional uncertainty relations of two types: 
(1) A lower bound on the minimal joint uncertainty that two remote parties (Bob and Eve) have 
about the outcome of a given pair of measurements performed by a third remote party (Alice), 
conditioned on arbitrary measurements that Bob and Eve make on their own systems. This lower 
bound is independent of the initial state shared by the three parties; (2) An initial state-dependent 
lower bound on the minimal joint uncertainty that Bob has about Alice’s pair of measurements in 
a bipartite setting, conditioned on Bob’s quantum system. 

PACS numbers: 


I. INTRODUCTION 

The discovery of quantum mechanics in the early twentieth century brought about a profound change in the way we 
view the physical world. The theory contains many counter-intuitive features for which we are still struggling to find 
satisfying interpretations. Nevertheless, it is remarkably successful in its agreement with observation, and in fact, its 
counter-intuitive features have found many practical applications. Among these features is the uncertainty principle, 
which states that there is a “minimum uncertainty” inherent in certain pairs of measurements [3- If the outcome 
of one of the measurements can be predicted with high precision, then the other measurement necessarily becomes 
highly imprecise. This principle has applications in cryptographic tasks [HHin], the study of quantum correlations and 
nonlocality mm, and continuous-variable quantum information processing [IMi]- 

An extensive amount of work has been done on the quantum uncertainty principle in connection with an isolated 
system [IMTT] . But if the quantum system in consideration is correlated with another system, this second system 
can act as a “memory”. The initial (pre-measurement) correlations can cause the memory to retain some information 
about the system after it has been measured, effectively “alleviating” the constraints imposed by the usual uncertainty 
principle. Berta et al. [1] were the first to consider this situation, and formulated a modified version of an entropic 
uncertainty relation that applies in the presence of a correlated memory: If is a bipartite quantum state shared 
by two parties Alice and Bob, and and are classical-quantum states that result, respectively, from Alice 
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measuring her subsystem A for the observables X and Z, then 

+ H{Z\B)r > log 2 - + H{A\B)p, (1) 

c 

where is the conditional von Neumann entropy, and c is a constant determined by the incompatibility of the 

observables X and Z. The interpretation of this identity is as follows: The first term on the left represents the 
uncertainty about Alice’s X measurement, given access to Bob’s system B. Similarly, the second term measures the 
uncertainty about Alice’s Z, given access to B. The last term on the right, H{A\B), can be negative, e.g. when 
is maximally entangled, mitigating—and sometimes completely eliminating—the effect of the log(l/c) term. This 
means that if Alice and Bob get along well enough, they can defeat the uncertainty constraint that otherwise plagues 
all incompatible quantum measurements! This result was subsequently experimentally demonstrated in |28j . 

Berta et aZ.’s uncertainty relation uses the von Neumann conditional entropy to measure the uncertainty of a 
variable conditioned on a memory (we shall call it “conditional uncertainty”), and the sum of individual conditional 
entropies to measure the joint conditional uncertainty of X and Z. But this measure is only one of infinitely many 
ways to measure joint conditional uncertainty, and does not capture its complete operational meaning. This is similar 
to how specific entanglement measures, such as the entanglement cost and relative entropy of entanglement, don’t 
completely capture the meaning of entanglement. On the other hand, the operational framework of local operations 
and classical communication (LOCC)—operations that do not create entanglement—transcends specific measures and 
completely characterizes entanglement. In [2HB], uncertainty without a memory (or “non-conditional uncertainty”) 
was characterized through a class of operations acting on probability distributions of a classical variable X. These 
“certainty-nonincreasing” operations are random relabelings of X, which result in doubly stochastic maps acting on 
. Given two distributions p^ and , the proper way of comparing their uncertainties is not through any entropy 
or other such measure, but rather through the relation of majorization |31| . p^ can be said to be more certain than 
only if the former majorizes the latter, denoted 


The quantum uncertainty principle was then derived by applying this operational framework on pairs (or larger sets) 
of distributions that result from quantum-mechanical measurements applied on quantum states. This majorization- 
based approach also enabled the construction of so-called universal uncertainty relations. These are relations of the 
form 


v(p^,q^) 


( 2 ) 


where p^ and are the probability distributions over the outcomes of measurements X and Z, respectively, applied 
on an arbitrary quantum state p. v is some vector (not necessarily in the same space as p and q) constructed out of 
p and q. a; is some constant vector, independent of p. The universality of such a relation is in the fact that for any 
measure lA of uncertainty, one can get a lower bound on the joint uncertainty (measured in terms of Z^(v)) of (p^, q'^) 
by evaluating lA on the fixed argument u). One type of universal relation, reported in Hi, uses the construction 


v(p^,q^) = p^(g)q^, 

while another type, derived in [m , uses 

v(p^,q^) = p^©q^. 


( 3 ) 

( 4 ) 


These “direct product”- and “direct sum”-type relations yield many known entropic uncertainty relations, via the 
application of appropriate entropy functions on the two sides of the majorization relation. 

In this paper, we will approach the conditional quantum uncertainty principle using a combination of the ideas of 
[D and HlSIi]: Incorporating a correlated memory and defining conditional uncertainty in a measure-independent 
way. We start by setting up an operational framework for conditional uncertainty, in Section [ll] We will dehne a 
mathematical relation, which we call conditional majorization, that acts as the basis for comparing the conditional 
uncertainties of variables, analogous to how ordinary majorization enables the comparison of non-conditional uncer¬ 
tainties. If and are two classical-quantum states that carry distributions of a classical variable X correlated 
with a quantum system B, then the relation conditionally majorizes in our formalism, denoted 


XB 




XB 


will signify that the variable X is more certain in the former state than in the latter, conditioned upon access to the 
quantum system B. 
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We will derive several results about conditional majorization, including a thorough characterization of the case of 
a classical memory (Section |II A[ ). Note that the case of a classical memory, i.e., classical correlations between X and 
-B, is still non-trivially different from the non-conditional case (from the viewpoint of uncertainty). Moreover, the 
classical-memory case helps in constructing measures of conditional uncertainty that apply to the quantum case. For 
some special cases, we will be able to further simplify the characterization, condensing the conditions to a finite number 
of inequalities. In the case with a quantum memory, we will derive a sufficient condition for conditional majorization 
(Section |ll B[ ). We will also develop a toolkit for constructing measures of conditional uncertainty (Section |ll C[ ). This 
will lead us to the investigation of measures of the joint conditional uncertainty of two or more states (Section |II D I. 
The concept of joint uncertainty is central to the quantum uncertainty principle, which is really about more than one 
measurement considered together. 

In Section m we will apply our calculus of conditional majorization to a typical cryptographic scenario, also 
considered in [siin]. Here, two legitimate parties, Alice and Bob, are engaged in an LOCC protocol (e.g., quantum 
key distribution), and must contend with an “eavesdropper”. Eve, who potentially has access to a purihcation of Alice 
and Bob’s initial state. Thus, the overall state shared by the three parties is of the form In [1], a bound 

was derived on the amount of information (measured in terms of the von Neumann conditional entropy) about Alice’s 
system that can “leak” to Eve, based on how much initial correlation Alice and Bob share: 

H{X\B), + H{Z\E)r > log2 -. (5) 

c 

This identity is, remarkably, equivalent to Q. In this latter form, the first term still quantihes the uncertainty about 
Alice’s X measurement conditioned on access to Bob’s system B. But the second term now represents the uncertainty 
about Alice’s Z measurement given access to the purification of Alice and Bob’s composite system—in other words, 
the uncertainty that Eve has about Alice’s Z\ While the form Q tells us that Alice’s measurement outcome can 
become completely certain, even for incompatible measurements, given complete access to a purification of her system, 
([^ shows that access to disjoint parts of the purification based on her different measurement choices—which is what 
Bob and Eve get individually, unless they both collaborate—can never completely lift the joint uncertainty of two 
incompatible measurements. This fact is related to the monogamy of quantum correlations: If Bob has access to a 
part of the environment that is highly correlated with Alice’s system, then the rest of the environment, visible to 
Eve, is necessarily very weakly correlated with Alice’s system. The extreme case of this is when Alice and Bob share 
maximal entanglement—then Eve gets no information at all! 

In Section |III| we will derive a similar result, but not involving any specihc measure of conditional uncertainty. 
Instead, our “universal conditional uncertainty relation” will state that, if Bob and Eve (respectively) perform mea¬ 
surements {Mf,} and {Mg} on their own systems when Alice (respectively) measures {Ma^j and the joint 

distributions of the variables (ai,&) in the hrst case, and ( 02 , e) in the second, obey 

(g) ^<3 fl. (6) 

This is similar to the universal uncertainty relation ([^, except that here we have a minimum joint conditional 
uncertainty that can be generated from any conditional uncertainty measured simply by evaluating Z//(a;i, ^ 2 ). Just 
by applying different measures U to this single identity, one could churn out different results similar to the one in [T] . 
Note, however, that the measures of joint conditional uncertainty generated this way are all of a particular special 
type: They quantify the uncertainty that Bob and Eve suffer when they try to learn about Alice’s measurement by 
also subjecting their own systems to measurements. In this sense, these measures of uncertainty are more in the spirit 
of the accessible information than the von Neumann conditional entropy. 

In Sect ion [Tv| on the other hand, we will use our results on quantum-conditioned majorization to derive a completely 
quantum universal conditional uncertainty relation. This relation will apply to a bipartite setting, where Alice and 
Bob initially share a state , and then Alice makes either measurement Mx or My (Bob makes no measurement). 
The relation we will prove is of the form 


(7) 

where and are the post-measurement states for the two choices of Alice’s measurement. This result demon¬ 
strates the power and versatility of the operational conditional uncertainty framework. We will also note that it can 
easily be adapted to a tripartite universal uncertainty relation, similarly to how ([^ is derived from Q. 

Finally, in our conclusive section, we will discuss several other possible operational scenarios that can be subjected 
to our methods, leaving their study for future work. 
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FIG. 1: The operational schematic for conditional uncertainty: Alice and Bob initially share a quantum-qnantum state 
that could contain quantum correlations. Alice’s measurement Mx turns her subsystem A into a classical register X; the 
correlations become classical, too. She then carries out random relabeling operations on X, possibly conditioned on classical 
outputs obtained by Bob. 


II. CONDITIONAL MAJORIZATION: AN OPERATIONAL FRAMEWORK 


A schematic of the operational set-up for conditional uncertainty is shown in Fig. Alice holds the quantum 
system of interest, A; Bob holds another system, B, which is the memory. The initial state shared by Alice and Bob, 
[32j . could contain classical or quantum correlations, even entanglement. But Alice will eventually perform some 
measurement on A. Let Mx = {Mx}xG{i...n} be the POVM associated with Alice’s potential measurement. The act 
of measurement maps her quantum system A to an n-dimensional classical system, X, that carries the measurement 
outcome. Alice and Bob’s post-measurement state is a classical-quantum (CQ) state, i.e., a state with the form 

n n 

^XB ^ ^ ^ ='^Px |a;) {xf ® erf, (8) 

x—1 x—1 

where {|x)}a; is an orthonormal basis consisting of “classical outcome flags” on the classical register X. We define 
Px = Tr and = p~^ Tr^ [(Mf (g) l^)p^^]. 

The conditional uncertainty is really a property of this post-measurement CQ state, and so such states are the 
objects in our formalism. In the remainder, we denote by CQ" the set of all CQ states with classical dimension 
|A| = n. Similarly, we will denote by CC" the set of all classical-classical (CC) states with the first subsystem n- 
dimensional. We will now identify the class of operations that cannot increase conditional certainty. Alice can perform 
classical certainty-nonincreasing operations, i.e., random relabelings, on X. Bob, on the other hand, is allowed to 
perform arbitrary quantum operations on B. Moreover, Bob’s action can in particular produce a classical output, and 
Alice’s action can depend on his the output. This results in the following general form for the allowed operations. 

Definition 1. For a state G CQ"^, a quantum-conditioned random relabeling operation (QCR) is a completely 
positive (CP) trace-preserving (TP) map of the form 


^XB ^ ^XB' _ 






x,x' — l j 


(o-x) 


1 B' 


(9) 


where j can run over an arbitrary number of values, each of whose corresponding is an n x n doubly stochastic 
matrix and £b') ^ trace-nonincreasing CP map, such that the map 

j 

is TP. Note that is also in CQ". For a pair of CQ states (cr^®, t^^') that satisfy (j^ for some QCR M, we will 
say that quantum-conditionally majorizes (or simply conditionally majorizes) , denoted 

^XB , XB' 
a >-cT 
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(or, equivalently, cr^^)- 

It is worth mentioning that the “quantum” element in the relation comes only from the memory upon which the 
(un)certainty of X is conditioned; to stress this, we use “quantum” (or, when appropriate, “classical”) as a sort of 
adverbial qualifier on the adjective “conditional”. Note that {QCR} C {LOCC}. Note also that, without loss of 
generality, we can assume that each has only one Kraus operator. Otherwise, if then 

we can define for all £. In order to characterize conditional certainty, we must find the conditions under 

which for a given pair ) of initial and final states. 


A. Classical-conditional majorization 


Let us first consider a classical memory B = Y, so that the states of XV are in CC", i.e., just joint probability 
distributions In this case we will dispense with the (here) cumbersome bra-ket notation, and instead represent 

the state using the matrix P whose components are Pxy We use the notation py = p^y for the marginal probability 
of F = y, and = Py^ iPxy)^ conditional distribution of X given Y = y. In this case, since Y 

is classical, we replace the £^^'>’s of ([^ with classical trace-nonincreasing operations (i.e., sub-stochastic maps). This 
results in transformations of the following form. 

Definition 2. For a classical register X and a classical memory Y initially in some joint distribution P G CC", a 
classical-conditioned random relabeling (CCR) is a stochastic map of the form 


P ^ Q = PR^^\ 

3 


( 10 ) 


where j can run over an arbitrary number of values, each of whose corresponding is an n x n doubly stochastic 
matrix and a matrix with non-negative entries, such that R^^'^ is row-stochastic. 

Remark (1). Any CC-to-CC state transformation P i-G Q that can be achieved via QCR can also be achieved by some 
CCR. For, let Q = Af{P) for some QCR Af. Then, merely follow Af with a rank-1 projection map in the classical 
basis of Q; this overall amounts to a CCR, while still achieving the transformation P i-G Q. In light of this fact, we 
will use the same symbol ‘Vc” to denote CC-to-CC convertibility via CCR. 

Remark (2). Arbitrarily reordering the elements of each coulmn of P is a CCR: Just choose = Ilj (permutations), 
and R^^'^ = ejj, i.e., a matrix with zeros everywhere except the {j,j) element, which is chosen to be 1. 

The number of rows of P is just the n that was defined before; this equals the number of rows in Q, because the 
effective transformation acting on X is always doubly stochastic. However, the arbitrary classical channels allowed 
on Y can change the number of columns in P to a different one in Q, transforming F to a different classical system 
altogether, which we call Z. We use £ (respectively, m) to denote the dimensions of F (Z). 

In the following, we will prove several results that provide a characterization of classical-conditional majorization, 
as well as offering insight into its nature. Providing most of the technical proofs in the appendix, here we limit to a 
high-level discussion of the essence of the results. 

We say that P and Q are conditionally equivalent and write P ~c Q, if P Pc Q and P Q- Identifying cases of 
such equivalence leads to further simplification, resulting in the following standard form: 

Definition 3 (Standard Form). Let P = [pxy] be an n x Z joint distribution matrix. Its standard form P'^ = [p'^xu] 
an n X P matrix, where £! < 1. To obtain P'^, we apply the following transformations on P: 


1. Reordering within columns: Arrange the elements of each column of P in non-increasing order. Since this is a 
reversible CCR, we can just assume WLOG that P has this form. That is, for all y = 1, P\y > Piy • • • > Pny 


2. Combining proportional columns: If two columns of P, say those corresponding to y and yt are proportional 
to each other (i.e., pi*' = pi** ), then we replace both by a single column {py J-PyOpI**. We do this until no two 
columns are proportional. The resulting matrix P contains £' < £ columns, each of the form pujpl'". 

3. Reordering the columns: Reorder the £' columns of P in non-increasing order of Pw If there are ties, resolve 
them by ranking the tied columns in non-increasing order of their first component (pi^)- If there remain ties, 
we rank by non-increasing {piw -\-p 2 w), and so on. 


The resulting n x f matrix is the final standard form p£. Lemma A.2 in the appendix shows that P'^ ~c P- 
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Hereafter, we will (often implicitly) assume all states to be in their standard form, without loss of generality. This 
enables an elegant characterization of conditional majorization through the n x n lower-triangular matrix 


/I 0 0 ••• 0\ 

1 1 0 ••• 0 

1 1 1 ••• 0 

Vl 1 1 ••• 1/ 


( 11 ) 


Lemma 1. For P and Q in the standard form, P >-c Q if cLnd only if there exists a row-stochastic matrix T such that 

LPT > LQ, (12) 

where the inequality is entrywise. 

The predicate of the existence of such a T is really the question of feasibility of the linear programming problem 
defined by the condition (12). Since the number of free parameters in T, and the specification of the linear program, 


are all linear in the sizes of the instances of P and Q, one can numerically decide the predicate efficiently using standard 
linear programming algorithms. However, the existential clause renders the condition rather opaque to intuition and 
qualitative understanding. In the following, we will try to overcome this obstacle. Firstly, the standard form, together 
with the simplihcation afforded by Lemma leads to some nice properties of the conditional majorization relation: 

Theorem 2. Let P, Q, R be three probability matrices. Then, 


Reflexivity: P >~c P. 

Transitivity: P >-c Q and Q >-c R P >-cR. 

Antisymmetry: P )~cQ and Q >-c P => P^' = Q^'. 

(13) 


That is, is a partial order with respect to the standard form. 

The long, but straightforward, proof is provided in the appendix. 

There is a hnite set of inequalities that prove sufficient as conditions for ordinary majorization between a given pair 
of vectors; these are in terms of the partial sums of the vector components m- In conditional majorization, it turns 
out to be difficult to hnd a finite sufficient set of conditions that are as simple to understand as those of majorization. 
While any instance of conditional majorization is efficiently solvable by linear programming algorithms, we would like 
conditions in some form that provide more insight. For example, conditions of the following form: 

P^cQ^HP) < HQ). 


To this end, our next, main result is that a restricted collection of such $-like measures together form a sufficient 
condition: 

Theorem 3. Let P and Q be n x I and n x m joint probability matrices. Denote by the set of all n x m 

row-stochastic matrices. Then, the following conditions are equivalent: 

1. QFcP. 

2. For all matrices A= [ai, ...,am] S 


t m 

^Py$A(p'^) > g™$A(q'“), (14) 

y—1 w—1 

where is a sub-linear functional given by 

$a(p'^) = max (a|;)'^ (pi*') . (15) 

3. For all convex symmetric functions 

^ 771 

^p,<i>(pl*')>^g^$(ql-). 

y—1 w—1 


( 16 ) 
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Remark. The matrix A can be assumed to be in standard form. Furthermore, by taking $ in (161 to be Shannon 
entropy, the left hand side of (16) becomes H{X\Y). 


The above theorem can be proved using standard methods from the field of linear programming, particularly Farkas 
Lemma. The details are provided in the appendix. 

We now look at a few special cases of the matrix dimensions n, I and m, wherein conditional majorization is 
relatively easier to verify: 


1. n = 1, with ^ and m arbitrary: In this case always holds, and therefore P ^cQ for any {P,Q). 


2. m = 1, with n and i arbitrary: Here Q = q is a one-column matrix equivalent to a probability vector q. Moreover, 
T in Lemma[^also have only one column, and since it is row-stochastic, T = e = (1,..., 1)^. Denoting p = Pe, 
we have 


P^cQ 


p ^ q 


(17) 


3. £ = 1, with n and m arbitrary: In this case P = p is a probability vector, and T = (ti, ...,tm) is a probability 
row vector. We therefore get 

P>cQ p ql'" V ru = 1,..., m. (18) 


4. £ = 2, with n and m arbitrary: In this case, we have the following theorem, proved in the appendix. 

Theorem 4. Let P and Q be n x 2 and n x m probability matrices given in their standard form. Define 

k k 

= {Px\i - Px\ 2 ) and ^ - p^\ 2 ) ■ (19) 

X—1 X—1 


Denote byl'^, , andX the set of indices {A:} for which p,k is positive, zero, and negative, respectively. Furthermore, 
define p = pi = J^xP^h s® ^^at 1 — p = P 2 . Also define: 


Qw 1 

f 1 

0, max - , 

i ; j3n, = — min -1 


a-w = — max < 

, 1, mm — J 

p 1 

kei+ pk J 

I P 1 

k^X~ /Xfc J 


and through these. 


W+{P,Q) = l-Y,a^-, 

W—1 


Wo{P,Q) = - max 

w\k£X^ 


w.{p,q)=(J 2 


- I; 


IFi(P, Q) = min (/3„ - a^u). 

w£{l,...,Tn} 


( 20 ) 


( 21 ) 

( 22 ) 


Then, Q <c P if and only ifWo, IFi, W+, and W- are all non-negative. 

The conditions in the theorem above can be simplified in special cases. The following special case will be helpful 
in constructing “universal uncertainty bounds” similar to those in [5]. 

Corollary 5. If pl^ ^ pl^ and also ql™ ^ pl^ for all w = then Q <c P if and only if W+{P,Q) > 0. 

Furthermore, W.^-{P,Q) > 0 if and only if 


P > / maxlO, where = max 

hczT+ 


,(t") 


W—1 




(23) 


Moreover, if pl^ = ei = (1, 0,..., 0)^, then 


/lu, = max ■ 

k 


1 - TTfc 


, with TTfc = y^Px| 2 - 


(24) 


x—1 


Proof. If pl^ ^ pl^ then I = 0. If in addition qfo ^ pl^ then ■ fo this case, we always have Hb > 0 

and Wi > 0. Then, from Theorem [dj Q P and only if W+{P,Q) > 0. Eq. (23) and (24) follow from direct 

_ I—I ' 

calculations, with ^x^iPx\i = 1- Q 










Using this corollary, for any given arbitrary n-dimensional distribution uj, we can construct a “minimal P” (that 
we call U) that conditionally majorizes a given Q, with the requirement that cj be one of the conditionals of this U: 

Corollary 6. -For an arbitrary n-dimensional distribution oj, define 

W^ = {w:q}^ 


If 

qw>I-a, 


then Q >~c Q, where 


n = 


a 0 

(1 — Ot)uJi (1 — Ot)u}2 


0 


n T 


(1 - a)Un 

This last corollary will be applied directly in SectiorjITT] to obtain a universal uncertainty relation. 


(25) 


(26) 


B. Quantum-conditional majorization 


In the general case, where the memory B can be quantum, conditional majorization becomes rather complex. In 
particular, it could be hard to find simple conditions that are both necessary and sufficient for conditional majorization. 
Nevertheless, we can find some sufficient conditions that prove useful in building quantum uncertainty relations. Let 
^XB ^ (^1^ 0 be a classical-quantum state where {(J^} ^ normalized states acting on Bob’s 

Hilbert space with dimH^ = d. Each can be written as: 

d— 1 d— 1 

^ ^y\x\y T {y 1 ^ ^ Qy\x\'^y\x) ('^y|a:| 

y^o y^O 


where {\x;y)}y^l are the eigenvectors of that corresponds to the eigenvalues \y\x- The set of vectors {\uy\x)^}y^ 
generate another possible decomposition of Ux- In general, the set {\uy\x)^}yZ^ is not orthonormal and in fact, in 
special cases it is even possible to have \uy\x) = \uy'\x) for y yf y'. 

Lemma 7. Let be a density matrix and denote by = (Ao|a;,..., A„_i| 2 :) its vector of eigenvalues. Then, there 
exists states {\uy\x)}yZl such that erf = Y.ti=o%\ooWy\oo)^{'^y\x\ if and only if 

ql"’ ^ a'"’, (28) 

where = {qo\x, ■■■,qn-i\x)- 

The uncertainty principle is about there being a “minimum possible uncertainty” in a state. Therefore, (conditional) 
uncertainty relations involve finding a lower bound to the (conditional) uncertainty of a measurement outcome. Here 
we are interested in finding a “state-valued” lower bound that can apply in general, without regard to what measure 
of conditional uncertainty is used. In other words, we are interested in constructing some G CQ" such that 

We would like this Q to be nontrivial, yet simple enough that it affords some insight. For this reason, 
we take as Ansatz the following: 

= c.|0)(0|^ 0 |0)(0|® + (1 - a;)|l)(l|^ 0 |^)(^|® (29) 

where |'0)^ is some fixed state in Bob’s Hilbert space, We also completes |0)® (i.e. the state appears in fl^^) to 
an orthonormal basis which we denote by {\y)^}yZo for ah 2 / = 0,..., d — 1. In the theorem below assume that d = n, 
since if d < n we can always embed TL^ in C", and if n < d we can always add terms to with zero probabilities. 
We will therefore use the same index n for both d and n. 

Theorem 8. Let and be CQ states as above, and for each ax let {qy\x, \uy\x)}'i^ZQ one of its many 

decompositions. Denote by Cy = (yltp) the y component of \ijj) when written in the basis {|y)®} discussed above. 
Finally, for any j, k G {0,..., n — 1}, with j yf k, denote by 


r^y'^'^ = qy\k\{Uy\j\Uy\k) 


-LA) = 


— ^ ' di/|fc I ('at/1,7 I ’ 


.* = j.U,k) 

jk — 




y=o 


{y\ 


mm 

Ai.k) 


> 0 } 


%\j 

J3,k) 


(30) 
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Then, -<c if oj < and there exists j,k G {0, ...,n — 1}, with j k, such that the following condition 

hold: 

W = l^jfe(w)) = ^ (^'/okT^lO)^ + y/l-uj- rd’f^lqk |1)^) (31) 

Corollary 9. Using notations as in Theorem^ if all aff = \4>x)^{4>x\ then for any j € {0,..., n — 1} and uj € [0, qj] 

c.|0)(0|^®|0)(0|^ + (l-a;)|l)(l|^®|vl/,(a;))(vl/,(c.)|^ (32) 

where 

(c, |0)^ + |1)^) (33) 

and 


Cj = max|((/)j|())fc)|y^ 


(34) 


C. Measures of conditional uncertainty 


Just like Schur-convex functions provide a way of understanding ordinary majorization, we can understand the 
structure of conditional majorization through similar functions: 

Definition 4. A measure of the uncertainty of X conditioned on B is a function Ux\b ■ CQ^ —)■ M, with the following 
properties: 

1. Ux\Bm (or®|o) (0|®)=0. 

2. Ux\b{(t^^) < Ux\B' whenever r^^'- 

Condition above, for any conditional uncertainty measure Ux\Bj is a necessary condition for . 

But since the conditional majorization relation is not a total order, it is not completely characterized by any single 
measure. 

How do we construct such measures? One way is to first consider the classical-conditional case: 

Definition 5. A measure of the classical-conditioned uncertainty of X given H is a function Ux\y '■ CC" —>■ K, with 
the following properties: 

1. Ux\y{P) = 0 if pxy = Pydxy (or equivalently px;\y = 5xy)- 

2. Ux\y{P) < i^x\Y (Q) whenever P Q. 

Now, define the following: 

Definition 6. Let € CQ’^. For a classical-conditioned uncertainty measure (as in Definition IdxjY, the 
minimum U -classical uncertainty of X given B is a function A4u : CQT —>■ K defined by 

Mu := minfJxiv [P) , (35) 

' ' A /f,. ' 


where the minimization is over all POVMs My describing possible measurements on B, where P G CC" results from 
through a particular choice of this measurement. 


Lemma 10. It is sufficient to take the minimization in (35) over all rank 1 POVM measurements. 


The proof will be provided in Appendix [C| Lemma C.l of the appendix establishes that such a measure is non¬ 
decreasing under QCR, thereby justifying its nomenclature. Indeed, from Theoremj^ it follows that for any symmetric 
concave function $, the function 


(P) = (pl^) 

y 


( 36 ) 
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is a measure of uncertainty of X\Y. For this choice of U the minimal conditional uncertainty is given by 

M<s> = mmU,p (P). (37) 

In the case that is the Shannon entropy H, 

M.^=h = miniJ(X|F) 

^ ' My 

= H{X) - maxH{X : Y) > S{X\B)„, (38) 


where the last inequality follows from the Holevo bound on the accessible information, maxM^ H(X :Y). 

A useful method for generating conditional uncertainty measures is using contractive distance measures (or even 
pseudo-distance measures such as the relative entropy) on the space of density matrices. Let A(-, •) be a measure that 
is contractive under quantum operations, i.e., A{£{pi),£{p 2 )) < A(pi,p 2 ) for any two density matrices (pi,P 2 ) and 
any quantum channel £. Now consider the following class of states: 

This is the maximal subset of CQ” with the property of closure under the set of QCR operations. That is, the property 
that y£ G {QCR}, £(P’‘) C J"". This is the set of states that can be considered the most conditionally uncertain. 
Now define 7 /a : CQ" —M given by 


Uf^(a^^)=c— inf A V (39) 

with c a suitable constant. Then, by virtue of the contractivity of A, 7 /a is a measure of conditional uncertainty of 
X given B. Many important entropic “distances”, such as the relative entropy and the Renyi divergences for certain 
parametric regimes, can be used in place of A to yield such measures. 


D. Joint conditional nncertainty 

Since the uncertainty principle is about more than one potential (actual or counterfactual) measurement on Alice’s 
side, we require a notion not of the conditional uncertainty of a single measurement, but of the joint conditional 
uncertainty of two or more measurements. Here we will consider the case of two measurements, but the same 
methods can be naturally adapted to more than two. 

We would like first to define the most general notion of joint conditional uncertainty that is independent on whether 
the two measurements on Alice’s system are actual or counterfactual, and is not restricted to a particular scenario or 
task. We therefore only require from such a measure to be monotonic under joint QCR: 

Definition 7. Let J : CQ^ x CQ™' -G M, and let oxb € CQ” and ^zc € CQ™. Then, J' is a measure of the joint 
conditional uncertainty if: 

(1) J {axB.lzc) = 0 if both axB and 'jzc are product states of the form |0)(0| ® p. 

( 2 ) J {axB.lzc) > J (d'xBGJzc') if both axB ^xb' and "fzc -<c Izc- 

Similarly to the minimum classical conditional uncertainty measures, one can construct measures of joint conditional 
uncertainty from classical measures. This can be done as follows. Let : CC" x CC™ — )■ K be a measure of classical 
joint uncertainty (i.e. satisfying the conditions of Def. [^for classical states/distributions P and Q). Then, 

Theorem 11. Let uxb G CQ^ and "fzc € CQ™ be two classical-quantum states; let My and M\y be POVMs 
acting, respectively, on B and C; and let P = (pxy) o,nd Q = {qwz) be two probability matrices with probabilities 
Pxy = Tr WxB (|3:)(a;| 0 My)] and q^w = Tr [yzc (k)(2| ® M^u)]. Then, the function 

J {(JXB,1ZC) ■■= min J‘^\P,Q) (40) 

My iMz 

is a measure of joint conditional uncertainty. 
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The basic definition of a joint conditional uncertainty measure is independent of any operational task, game, 
or scenario that involves some facet of the conditional uncertainty principle. But specific tasks often admit the 
construction of useful measures that naturally reflect the structure of those tasks. For example, consider the scenario 
in Fig.[^ Here Alice and Bob share two copies of the same state, ^ On the first copy, Alice performs the 

measurement Mx', on the other copy, she performs Mz- Effectively, the two measurements Mx and Mz have been 
combined into one joint measurement on a tensor-product system. Then, the conditional uncertainty of XZ given 
B 1 B 2 in the resulting CCQQ state is actually a manifestation of the joint conditional uncertainty of X\B and Z\B-, 
therefore, any measure of conditional uncertainty of this state is effectively a measure of the joint uncertainty of the 
two measurements conditional on H. In Section IV we will find a universal conditional uncertainty relation based on 
this scenario. 


Bob Alice 




My 


X I 



FIG. 2: Alice and Bob share two copies of the same state. Different measurements by Alice lead to a joint conditional uncertainty 
that manifests as the conditional uncertainty of the global state. 


One can make variations on the above scenario, depending on the particular task. For example, Bob could perform 
measurements on his systems. In another scenario (Fig. |^, Alice and Bob actually share only one copy of the state. 
But now a shared random bit r determines the measurements that they each perform. Here again, Alice’s two 
measurements have been combined into one; but this time, instead of both measurements happening on a different 
copy of the same state, one randomly-chosen measurement is performed. Any measure of conditional uncertainty 
of this randomized hybrid measurement is then a measure of joint conditional uncertainty of the two individual 
measurements. 

A particular scenario that is relevant in cryptographic tasks is depicted in Fig. Three parties, Alice, Bob and 
Eve (who can be interpreted innocently as the “environment”, or maliciously as an “eavesdropper”) share a pure 


Alice Bob 



FIG. 3: Alice and Bob share only one copy of a state, but their measurements are chosen probabilistically. 
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Alice Bob 



FIG. 4: Alice, Bob and Eve share a pure state |4'). Alice chooses randomly between measurements Mx and Mz- Upon her 
choosing Mx, Bob makes the measurement My- If Alice chooses the measurement Mz, Eve performs Mw- 



FIG. 5: Alice, Bob and Eve share two copies of a pure state l^*). On the copy where Alice makes measurement {Ma^}, Bob 
makes the measurement {Mt,}. On the other copy, Alice performs the measurement {Ma^}, whereas Eve performs {Me}. 


state Such a scenario is considered, for example, in security proofs of cryptographic protocols |TJ[2ni where 

one must assume, to account for the worst case, that an eavesdropper has access to a purification of Alice and Bob’s 
shared system. Even within this tripartite scenarios, one can consider variations based on the way in which multiple 
measurements are incorporated. The setup in Fig.[^ for example, involves a shared random bit r that decides whether 
Alice will make measurement Mx or Mz- If she happens to choose Mx, then Bob makes a measurement My on his 
system; else. Eve measures her system through Mw- Once again, any measure of the conditional uncertainty of the 
CQ state resulting from the hybrid measurement amounts to a measure of the joint uncertainty of the two individual 
measurements, conditional on Bob’s and Eve’s systems, respectively. 

In the scenario of Eig. like in Fig. the three parties share two copies of the same pure state Ilk). Now the two 
alternate r cases of Fig. occur, but on different copies of the state. In the following section, as a demonstration 

of the application of the methods we have developed thus far, we will derive a universal uncertainty relation based on 
this tripartite scenario. 


III. APPLICATION I: UNIVERSAL, STATE-INDEPENDENT TRIPARTITE MEASUREMENT-BASED 

CONDITIONAL UNCERTAINTY RELATIONS 


We will now construct a universal conditional uncertainty relation for the scenario in Fig. Alice, Bob, and Eve 
initially share two copies of a pure state Ik = |dl) (Ikl, i.e., a state ^ i,^A 2 B 2 E 2 ^ Alice performs measurements 
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{Mai} {Maj} on her two copies. Bob performs {M;,} on his first copy and does nothing with his second. Eve does 
nothing with her first part, and measurement {Mg} on her second. Treating x = (oi, 02 ) as one variable, and w = (6, e) 
as another, we will apply our characterization of minimum classical conditional uncertainty (Section |II C[ ) on the joint 
CC state on XW to get a lower bound on the joint minimum classical uncertainty of Alice’s two measurements 
conditioned separately on Bob’s and Eve’s systems. We will assume that Alice’s measurements are both rank-1 
projective measurements. The post-measurement CC state Q on XW is given by 


Qxw — QaibQa2ei 


(41) 


with Qa^b = Tr (^M^/ (g) Mf ^ (g) ^ ^ ^ ^£2) ^A2B2B2j _ 

In order to lower-bound the joint conditional uncertainty of ai\b and 02 |e, we will lower-bound the conditional 
uncertainty of A'|IT, which in turn we will accomplish by constructing a “state-valued” upper bound on Q, under the 
‘Vc” partial order. That is, some G CC”, such that 




Q 


XW 


XB 


(42) 

gCC”, 


X C 

One possible upper bound is of the form Oxriv = |0) (0| (g) |0) (0| , which conditionally majorizes every cr 

let alone G CC”. But this bound would be trivial, because this O contains no conditional uncertainty. On the 

other hand, Q itself is another trivial upper bound: It is as tight as one could desire, but is state-dependent (in the 
strongest way possible), and as such, provides little insight. We would like to find some O that strikes a balance 
between these factors. 

The simplest kind of state that has some conditional uncertainty is some G CC ” suc h that ^ = |y| = 1 and 
O'! ^ (1,0,0..., 0)^. Let us try to hnd an upper bound O of this form. Erom Section 
the condition for Q in the case I = \-. 


IIA 


recall (18), which gives 


w q' 


\w 


(43) 


where u) is the lone column of VL. If we wish to make this vector state-independent, and dependent only on Alice’s 
measurements {Majj and {Mq 2 }, then it must majorize a Q that results from any dt and any choice of Bob’s and 
Eve’s measurements. We will now show that this results in the trivial upper bound flTriv, irrespective of Alice’s 
measurements. 

Recall our assumption that Alice’s measurements are rank-1 projective. Then, let |ai) (ai| be one of the POVM 
elements from her first measurements, and |q; 2 ) ( 02 ! one from her second. Now consider the tripartite pure state 

|'I^^'''^) = ^(l«i)|0)|l) + |a2)|l)|0)). (44) 


If Bob’s and Eve’s measurements are both just projective measurements with respect to their respective canonical 
bases, then the conditional distribution ql“'=(^>®)=(o.o) jg giye^ t)y 


^(ai,02)1(0,0) 


,ai ^02 ,02 ■ 


(45) 


Now considering (43), we see that the above conditional distribution forces fl to be flTriv So we have to look beyond 
the i = 1 case; let’s now try 1 = 2. Corollary assures us of the existence of such an 17 given a suitable vector a;. In 
the following, we will construct such a vector, to finally get our desired 17. 


Theorem 12. Define c = maxaj^^a 2 |(oi|q! 2 )|> where ai and 0:2 OLfe eigenvectors of Alice’s Ai and A 2 measurements, 
respectively. Then for arbitrary state ipABE o,nd measurements {Mb} and {M^}, we have 


with 


where 


Q 


(1 — a)tLii (1 — a)a;2 ... (1 — afojn 

a; = (^,...,/3,l-l/3,0...0) 


(46) 


(47) 


(48) 


with I being largest integer such that /3l < 1 and a,/? satisfy 

al3 = + 


(49) 
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Proof. We apply Lemma [E.2[ obtaining that 


y^g^maxg^l^, < -(1 + c)^. 


We then apply Lemma E.l with r = (1 + c)^/4. 
for 


(50) 


□ 


The relation (46) constitutes a nontrivial uncertainty relation, when a and /3 are strictly less than 1. This happens 

(51) 


^(1 + c)^ < a < 1. 
4 


Example: In the case of two qubit measurements and we get 

a (l-a)P 1 , n 

0 (l-a)(l-^) 


(52) 


IV. APPLICATION II: UNIVERSAL UNCERTAINTY RELATIONS BASED ON 
QUANTUM-CONDITIONAL MAJORIZATION 


Here we will construct another type of universal conditional uncertainty relation, this time for the scenario of Eig.j^ 
wherein Alice and Bob initially share two copies of a state, and then only Alice makes measurement 

Mx and Mz on her parts. Bob retaining his systems unmeasured. We provide a state-valued upper bound on the 
post-measurement state 

Consider first the case of a pure p = \il}){il)\. We will consider here orthogonal projective measurements of the form 
Mx = |sa;)(sa;| and = \tz){tz\ where {|sx)}"Cg and {\tz)}'fZo orthonormal bases of We write 

in its Schmidt form 


Then we have 


where 


IV') 


AB 


n—1 




B 




n—1 


'^Px\x){xf 

X — {) 







y=0 


n—1 

with px = Y. ^v\{sx\y)\‘^- 

y=o 


(53) 


(54) 


(55) 


Similarly, 


.J.ZB2 


n—1 


Y<lz\z){zf 0 \(pz)^H‘Pz\, 

z^O 


(56) 


where 


- n—1 n—1 

V y=0 y=0 

Hence, from Corollary the following universal uncertainty relation follows: 

Theorem 13 . For any choice of 4 indices {xi,zi) ^ (2:2,2^2), it holds for all lu G [Q^Px^qzi] that 

6 ?l j QXZB1B2 

(J 09“ “a:iX2 2i22 ’ 


(57) 


(58) 
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where = w|0)(0|^^ (g) \xiZi){xiZi\^^^^ + (1 - w)|l)(l|^^ ® , with 


l^'.T 


.H) 


B 1 B 2 _ 


\{(l)xA(t>X2){‘fzA‘Pz2)\ VP^2 qZ2\Xl)^"\Zl)^^ 


•\/l — UJ L 

+ VPxiqz^ + (1 - \{(l}xi\(l>X2){V>Zi\y:’z2)\'^)Px2qZ2 - w|a;2)^H^2)^" 
+ Vp=^^z\x)^"\z)^^ 


(59) 


(X,Z)^{(X1,Z1},(X2,Z2)} 

We can rephrase this universal uncertainty relation in terms of measures, as 


Theorem 

Then, 
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(Alternate form). Let Uxz\BxB 2 ^6 ® measure of conditional uncertainty as per Definition 
ldxz\B^B 2 {(x^^^ > , max max U = r]{Mx, Mz,'if). (60) 

(xi,zi)^(x2,Z2) ) 


This relation can also be extended to mixed bipartite states, if we restrict to jointly-concave conditional uncertainty 
measures, i.e., if 


1^xz\b,B2 > 


'^T'urw Uxz\b^B2 ® 


(61) 


k,k' 


where 


^XB, ^ 


XBi 


and 


rXB2 


= E 


r.rr^- 


Consider a mixed bipartite state 


pAB = ^rfc|'0fc)(-!/'fc| 


AB 


(62) 


Let be the CQ state obtained by measurement = {|sa;)(s 2 ,|} on and be the CQ state 

obtained by measurement = {|tz)(tz|} on The corresponding CQ states obtained by measuring 

and , respectively, on are then and Therefore, 


,XBi ^ ^ZB2 


^zb2 ^ 'Y Tf^rk' erf g) rf 


(63) 


k.k' 


is a convex mixture of “conditionally pure” CQ states. It LIxz\BiB 2 is concave, it follows that (60) holds for any p. 

Corollary can be used in other scenarios, as well, to yield fully quantum conditional uncertainty relations. For 
example, we can apply the same corollary to the tripartite tensor-product scenario of Section |III[ without Bob and 
Eve making measurements. 


V. DISCUSSION AND CONCLUSION 

We developed an operational framework that defines conditional uncertainty—that is, uncertainty of a classical 
variable (a measurement outcome) conditioned on access to a quantum memory—through a mathematical relation 
called conditional majorization. This relation, while a natural generalization of majorization, is much harder to 
characterize. The relation, denoted ‘Vc” j is defined between classical-quantum (CQ) states. A CQ state is more 
conditionally uncertain than another, , if and only if >-c ■ 

We found several useful results about conditional majorization in the classical-memory case. These include an 
efhciently-computable linear program to verify classical-conditioned majorization, a class of measures that provide 
necessary and sufficient conditions for the relation, and much simpler conditions in some special cases. In the quantum- 
memory case, we proved a sufficient condition for conditional majorization. We also found a method of constructing 
a tight state-valued upper bound (under the ‘Vc” relation) to a given state . 

We then developed a toolkit that can be used to construct measures of conditional uncertainty. Measures of 
quantum-conditional uncertainty (which are defined on CQ states can be constructed from measures of classical- 
conditional uncertainty (defined on CC states by applying the latter to probability distributions obtained by 
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measuring the memory system (_B), and then optimizing over this measurement. We called such measures “measures 
of the minimum classical-conditioned uncertainty”. In addition, contractive metric or pseudo-metric functions on the 
space of density matrices can be used to define conditional uncertainty measures. These measures are in the spirit of 
“least distance to the set of maximally conditionally-uncertain states”, which are states of the form 0 tr®. 

We then extended the idea of measures, to quantify the joint conditional uncertainty of two or more measurements. 
After defining the most general notion of joint conditional uncertainty measures, we considered some special opera¬ 
tional scenarios that naturally represent some aspect of joint conditional uncertainty. These include a few bipartite 
and tripartite operational schemes (Figs. [||||4]and[|). 

We used these operational schemes in two illustrative applications. The first one applied our characterization of 
minimum classical-conditioned uncertainty to a three-player game (Fig. ™ where Alice, Bob, and Eve initially share 
two copies of a pure tripartite state, ^ ^-4252^2_ Subsequently, Alice makes measurements {Ma^} and {Ma^} 

on her copies, while Bob and Eve each measure only one of their respective copies (with measurements {Mf,} and {Me}, 
respectively), discarding the other. We derived a state-valued upper bound to the resulting post-measurement 4-partite 
classical-classical state, effectively lower-bounding the joint uncertainty of Alice’s two measurements conditioned on 
the systems owned by Bob and Eve, respectively. This universal conditional uncertainty relation is of the form 

® (64) 

The other application was to bipartite scenario (Fig. [^, where Alice and Bob share two copies of some pure state, 
(g, ^A 2 B 2 ^ Alice makes two different measurements on her copies, while Bob keeps his system as it is. We 
then found a state-valued upper bound on the resulting 4-partite CQ-CQ state, resulting in the following universal 
conditional uncertainty relation: 


XB 


ZB 




AB\ 


(65) 


An open problem that immediately suggests itself is regarding the state-dependent nature of the universal relation 
It is not hard to see that, in the present form of the relation, a state-independent bound would necessarily 
Just consider the initial state to be maximally entangled, in which case the conditional 


(60). 


have to be trivial: 

uncertainty is zero for any projective measurement on Alice’s part. On the other hand, Berta et aUs relation 


H{X\B)^ + H{Z\B)^>\og, 


1 


HiA\B), 


( 66 ) 


retains a non-trivial element even for the maximally entangled case, essentially by splitting this zero conditional 
uncertainty between the two terms on the right: The first (positive) term representing the incompatibility of the 
two measurements, and the second (negative) term embodying the initial correlation between A and B. It would 
be interesting if our universal relation could be made nontrivially state-independent using a similar “split” between 
measurement incompatibility and initial correlation. 

Note that the tripartite case does not suffer from the same problem, enabling us to formulate a state-independent 
universal relation in Section |III[ where we considered accessible information-like measures of conditional uncertainty. 
We expect a state-independent tripartite relation to be obtainable also for other kinds of measures, using Corollary 
or similar results. We leave this, too, for future work. 

We expect that the characterization of quantum-conditioned majorization simplihes in special cases, such as the case 
of pure initial states, qubit measurements, etc. These cases are left for future work, as well. Another potential future 
work is to find other ways of constructing measures of individual and joint conditional uncertainty. For example, one 
could take measures dehned on CQ states whose conditional quantum states are pure, and then extend these measures 
via the convex-roof extension. 

An important project to be undertaken is the application of our methods to concrete cryptographic tasks, such as 
key distribution and coin-tossing. Replacing the existing, entropy-based methods with majorization-based methods 
is likely to improve the analysis of the single-shot or finite-size case of such tasks, possibly beyond the improvement 
already afforded by the smooth Renyi entropy calculus. 

The scenarios we considered in our applications, and indeed, all the ones depicted in the figures, are but a handful of 
examples that we contrived to illustrate our methods. The framework of conditional uncertainty (and joint conditional 
uncertainty) that we have developed is very general and versatile, and can be used in many other scenarios. This 
opens up plenty of avenues for future investigation. 
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k k 

[31] An n-dimensional vector p is said to majorize another vector, q, denoted “p ^ q”, if for all k G {1, 2 ... n}, X] — X] 

i=i 

where p'^ is a vector consisting of the components of p arranged in nonincreasing order, and likewise, q^. 

[32] Slightly deviating from convention, we will use superscripts to specify the names of subsystems A, B, etc. 


Appendix A: Classical-conditional majorization 

Notation: P is the matrix whose components are the joint probabilities Pxy We use the notation py = 
for the marginal probability of y = y, and = Py^ {Pxy)x f®'' the conditional distribution of X given 

Y = y. Denote by the set of all n x £ row-stochastic matrices, and by the set of all n x £ matrices with 
non-negative entries. In particular, P G in our generic example, while Q € 

We are interested in the following condition for the conditional majorization relation P >-(. Q: 

Q = (Al) 

3 

where each is an n x n doubly stochastic matrix, and each is an £ x m matrix of non-negative entries, with 
row-stochastic. 

Remark. Arbitrarily reordering the elements of each column of P is a CCR: Just choose D^^'> = lij (permutations), 
and R^^^ = Cjj, i.e., a matrix with zeros everywhere except the (j, j) element, which is chosen to be 1. 

In componentwise form: 


i 

3 y=i 

Here (D^^^py)x is the a;**' component of the vector D^^^py, where Py is the column of P (we will similarly use Qu, 
for the columns of Q). To simplify this relation, we denote 


R 


(3) 


ty^ = Y,Rifl and 


(A3) 


^yw 


Note that is an n X n doubly stochastic matrix and T = [ty 

notations the relation in (A2) becomes 


is an £ X m row-stochastic matrix. With these 


y=i 


p)(v,w) 


(A4) 


Lemma A.l. P >-c Q if cind only if there exists a set of n x n doubly stochastic matrices (£m in number) 

and an i X m row-stochastic matrix T = \tyw\ such that the columns of P (denoted Py) are related to those of Q (ciw) 
through (A4). 


Proof. We already proved that P >-c Q implies the existence of and T with the desired properties, satisfying 

(A4). It remains to show that if such matrices exist, then there exist D^d'> and R^^i that satisfy (Al). Indeed, since 


are doubly stochastic we can write them as where H^-^^ are permutation matrices and 

El = 1- Hence, taking R^^'> to be the matrix whose components are tyj^Syw, and the permutation matrices 


H^-^^ completes the proof. 


□ 


Lemma A.2. Consider an nx i probability matrix P = [pi,..., p^] such that one of the columns, say pi, is a multiple 
of another column, say P 2 . That is, there exists non-negative real number X such that pi = Ap 2 . Then, 


= [(1-t A)P2,P3,...,] 


(A5) 


where P' is n x (£ — 1) probability matrix. 
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Proof. Let T = [ti, be the following £ x {£ — 1) row stochastic matrix 


T = 


1 0 ••• 0 

h-i 


where Ii-i is the (£ — 1) x (£ — 1) identity matrix. Note that PT = P' and therefore P' -<c P- Let S = [si, 
be the following {£ — 1) x £ row stochastic matrix 


5' = 


/ l+A 1 + A " 
0 0 1 


V 0 0 

Note that P'S = P and therefore P P'■ Hence, P ~c P'■ 


0 


V 


(A6) 

■, Sf_l] 

(A7) 

□ 


Note that the elements of each column of P can be arbitrarily reordered under the allowed transformations. Together 
with the above lemma, this allows us to define the following standard form: 

Definition ^ Let P = [pxy\ be an n x £ joint distribution matrix. Its standard form P^ = is an n x £' matrix, 
where £' < L To obtain P'1, we apply the following transformations on P: 

1. Reordering within columns: Arrange the elements of each column of P in non-increasing order. Since this is a 

reversible CCR, we can just assume WLOG that P has this form. That is, for all y = piy > P 2 y • • • > Pny 

2. Combining proportional columns: If two columns of P, say those corresponding to y and y' , are proportional 

to each other (i.e., = p^^ ), then we replace both by a single column {py + pyi)p^y. We do this until no two 

columns are proportional. The resulting matrix P contains £' < £ columns, each of the form pujpl'". 

3. Reordering the columns: Reorder the £' columns of P in non-increasing order of If there are ties, resolve 
them by ranking the tied columns in non-increasing order of their first component (piu;)- If there remain ties, 
we rank by non-increasing (pi^, +P 2 w), and so on. 


The resulting n x £' matrix is the final standard form P^. Lemma A.2 implies that P'^ ~c P- 

In the remainder we will assume that P and Q are given in this standard form. Since all the in (A4) are 


k 

^ ^ Qxu 


V fc = 1,..., n. 


(AS) 


doubly stochastic, we get that 

e k 

— 

x—1 y—^ x—l 

Note that by taking the sum over w on both sides, we get that the marginal distributions p = {J2yPxy)x and 
91 = ^xw)x satisfy p q. Denote by P and Q the matrices whose components are 

k k 

Pky — ^ ( Pxy and Qkw — ^ ( Qxw (A9) 


That is, 


where L is the following n x n matrix: 


P = LP and Q = LQ, 


(AIO) 


L = 


/I 0 0 
' 1 1 0 
1 1 I 


Vl 1 I 


0 

0 


1 / 


(All) 


We will now prove an important simplification due to the standard form, which we stated as Lemma[^in the main text: 

Lemma [H For P and Q in the standard form, P Pc Q if cind only if there exists a row-stochastic matrix T such that 

LQ < LPT, (A12) 


where the inequality is entrywise. 
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Proof. We have seen in Eq. (A8) that if Q P then there exists a row-stochastic matrix T such that Q < PT. 
Conversely, suppose there exists a row stochastic matrix T satisfying (A12). Denote A = PT and the components of 
A by 


^kw — 


^ ^ Pkytyw • 


y^l 


(A13) 


The condition Q < PT is equivalent to q^j where = {akw)k, and the symbol stands for weak 

majorization (i.e. instead of equality 9kw < Sfc=i o,kw)- It is known (see e.g. [30]) that q^, a™ if and only if 

there exists a non negative (entrywise) matrix and a doubly stochastic matrix such that q^, = and 

5 '(™) < £)(’") entrywise. Therefore, there exists doubly stochastic matrix such that q^, < In components 

it reads 


n n £ 

^ik Pkytyw — Qxw 

k=l k=l y=l 


(A14) 


However, since the components qxw a nd g(, ,„ both sums to one, the condition 0 < ^ 
Hence, this equation is equivalent to (A4) with = D^'^\ and from Lemma 

completes the proof. 


< 


A.l 


implies that qxw = q'xi 


> Qx 

it follows that Q -<c P- 


This 

□ 


Remark. Note that the proof of Lemma [l] also implies that Q P if and only if there exist m doubly stochastic 
matrices D^'^\ and a row stochastic matrix T such that 





^ ^ ^ iywV^y 1 ^ 

(A15) 

This is a simpler version of (A41 
matrix A, whose columns are a^, 

For any set of m doubly stochastic matrices V = {D^P, 
we define 

...,DP^}, and an n x m 


V[A] := [D(i)ai,...,DWa„]. 

(A16) 

With these notations 

QPcP ^ Q = V[PT] 

(A17) 


for some set of m doubly stochastic matrices P and a row stochastic matrix T. 
We now prove Theorem]^ of the main text: 


Theorem Let P, Q, R be three probability matrices. Then, 

Reflexivity: P <c P 

Transitivity: P <cQ and Q R P <c R 
Antisymmetry: P Pc Q and Q Pc P pf = Qf 

(A18) 


That is, Pc is a partial order with respect to the standard form. 


Proof. Reflexivity and transitivity of Pc follow directly from its definition in (Al). To prove antisymmetry, suppose 
P Pc Q and Q Pc P (be. Q '^c Hb and suppose Q and P are given in their standard form; that is, we assume 
P = Pf and Q = Q^. From Lemma we know that there exist stochastic matrices T and R such that LQ < LPT 
and LP < LQR. Combining these two inequalities gives 


LQ < LQRT and LP < LPTR. 


(A19) 


Since RT is a row stochastic matrix, the sum of the columns of LQ and LQRT are the same. Therefore, we must 
have LQ = LQRT, and using similar arguments we also get LP = LPTR. Since L is invertible, this in turn gives 

Q = QRT and P = PTR. (A20) 


We next prove, that RT and TR must be the identity matrices. 
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Lemma A.3. Let A be anmxm row stochastic matrix, and Q = is annxm matrix with non-negative components. 
Then, 


Q = QA => A = Im- 


(A21) 


The proof is by induction over m. For to = 1, Q is an n x 1 column matrix. Then, A is a 1 x 1 row stochastic 
matrix, i.e. the number 1. Therefore the lemma holds for to = 1. Next, suppose the lemma holds for all n x to 
non-negative matrices (in their standard form); i.e. if for some m x m row stochastic matrix 

A*^™\ then A^"*^ = /„. We need to show that the same holds for all n x {m 1) non-negative matrices (in 

their standard form). Indeed, let be an (to -I- 1) x (to -|- 1) row stochastic matrix, and denote 


Q(m+1) 


QW|q^+i) and A(™+ 1 ) 



(A22) 


where u, v G and u is the sum of the components of u. Note that while A^™^ above is non-negative it is not 
necessarily row stochastic. More precisely, the sum of the columns of A^™^ is e — v, where e = (1,...,!)^. Now, 
suppose that With the notations above this is equivalent to 

Ql’”) =Q('")A(™)+q^+iU^ 

q^+i = -h (1 - u)qm+i. (A23) 


The second equation is equivalent to uq^-i-i = Therefore, if m = 0 then v = 0 since has no zero columns. 

Clearly, in this case we also have u = 0 so that and therefore A^™) = from the assumption of 

the induction. This also gives A^™ + 1 ) =/™+i. We therefore assume now that u > 0 and therefore v 7 ^ 0. Substituting 
qm-i-i = into the first equation of ( A23[) gives 


g(m) = Q(m) 


(A24) 


Since the snm of the columns of is 1 — v we get that A*^™^ -|- -vu^ is an to x to row stochastic matrix, and 
therefore from the assumption of the induction we get 

A(™) -h ^ vu^ = J^. (A25) 

u 

However, since u, v, A*^™) > 0 the components of u and v must satisfy 

UiVj = 0 for i 7 ^ j. (A26) 

Since both u 7 ^ 0 and v 7 ^ 0, there must exists io such that Ui„ 7 ^ 0, Vig 7 ^ 0, and Uj = Vj = 0 for all j 7 ^ io. However, 
in this case. 


Q.m+1 — ^ V — Qio • 

u 


(A27) 


That is, qm+i is a multiple of anther column of contrary to the assumption that is given in its standard 

form. Therefore, we must have u = 0, and as discussed above, this corresponds to = Im+i- This completes 

the proof of the lemma. 

Now, using this in the relation (A20l gives that both RT = and TR = I^. Hence, m = £ and R = T~^. But 
since both R and T are row stochastic, we must have that they are permutation matrices. Finally, since Q and P are 
given in their standard form, the permutation matrices R and T must be the identity matrices. This completes the 
proof of antisymmetry of -<c. □ 


For any P G 


denote by 


£(P, k) := {Q' G : Q' P) , (A28) 

which we call the Markotop of P. Note that the Markotop of P is a compact convex set. Its vertices are the set of all 
matrices of the form I?[PT] with V consist of to permutation matrices and T is a matrix whose rows are elements of 
the standard basis in K™. 
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Lemma A. 4. Given an n x i matrix P and an n x m matrix Q then 

^ E{Q,k)CE{P,k), y k^n. (A29) 

Proof. Suppose Q ~<c P and let A S E{Q,k). Then, by definition, A Q and Theorem gives A -<c P; that is, 
A G E{P, k). Conversely, Q G E{Q, m) C £(P, m) implies Q P- □ 

Let S£(pm) ■ —>■ M be the support function of the Markotop of P defined by 

5£(p,m)(^) := max {Tr(A'^(5) : Q G E{P,m)} (A30) 

for any A G Support functions of non-empty compact convex sets has the following property: 

E{Q,m) C E{P,m) (A31) 

From the previous lemma the support function provides a characterization of conditional majorization. We therefore 
calculate 


n m m £ n n 

Tr(A^Q) = EE ^kwQkw — EE iyw EE D^^JokwPxy (A32) 

k—1 w—1 w—1y—1 x—1k—1 

Since the set of doubly stochastic matrices is the convex hull of the permutation matrices we get 

71 n n 

max EE ^kx ^kwPxy — max ^ 7 z{x)wPxy — rrmx(na^) Py (A33) 

x—l k—1 x—1 

where the second maximum is over all permutations tt, and the last maximum is over all n x n permutation matrices 
n. Therefore, 


where 


t 

Se{p,m){A) = maxTr(A^X>[Pr]) “ max max(nau;)^py 


t 

Epy^A(p'^) 


= max max(nau,)^p'^ = 

w<.m n 



(A34) 


(A35) 


Note that is positively homogeneous (i.e. A$yi(pl^) = $^(Apl^) for A > 0), convex, and symmetric (under 
permutations of p^^). This leads to our main result: 

Theorem 1^ Let P and Q be nx£ and nxm joint probability matrices. Then, the following conditions are equivalent: 

1. Q<cP- 

2. For all matrices A = [ai, ...,am] G 


y—1 w—1 


where $a(p'^) is as in (A35). 

3. For all convex symmetric functions 




(A36) 


(A37) 


Remark. The matrix A can be assumed to be in standard form. 
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Proof. It remains to be shown that (3) follows from (1). Indeed, suppose Q -<c P- Then, there exist family T) oi m 
n X n doubly stochastic matrices and an i x m stochastic matrix T = [ti, such that 

Q = V[PT] = [D^^'>Pti, ..., (A38) 

Denoting by qw = X]”=i therefore get that 

ql’" = _L£)(“)pt™ = y" (A39) 

Qw Qw 

Note that the RHS of the above expression is a convex combination of since = Y^y=i tywPy We therefore 

have 

m m/t \ m i mi i 

y] ^ ^w,d> (DWpi^) < ^ ^(pi«) = Ypv"^ (p'") 

111=1 111=1 \y=l / 111=1 y—l 111=1 y —1 y —^ 

(A40) 

where the first inequality follows from the convexity of and the second from its Schur convexity. □ 


Insights from linear programming 


Theorem provides a characterization of conditional majorization in terms of the functions defined in (A36|. It 
is, however, possible to obtain a number of conditions using standard methods of linear programming. Consider the 
condition given in (A12) for conditional majorization, and let 


t = 


t 2 


nmi 


(A41) 


\tm / 

where ti, ...,1^ are the columns of T. Let T be the {nm + £) x £m real matrix 


f-LP 


r = 


-LP 


V h 


\ 


-LP 

h ) 


(A42) 


where It is the ix I identity matrix. Finally, let 


/-■Z-qi \ 


Tqm 

V e / 




(A43) 


where qi,...,qm are the columns of Q, and e = (1,...,!)^ G K^. With these notations we have the following 
proposition. 

Proposition A.5. With the same notations as above, Q <c P if o.iT'd only if there exists 0 < t G such that 


rt < b. 


(A44) 


Proof. The proof follows from Lemma[^ with the observation that if there exists a matrix T with non-negative entries 
that satisfies (A12) with ti. < e, then there also exists a matrix T' with non-negative entries that satisfies 

Eq. (|Al^ and XXi t'fc = e. □ 
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The proposition above has a dual formulation using one of the variants of Farkas Lemma [see for example p.91 in 
J. Matousek and B. Gartner, ’’Understanding and Using Linear Programming”]: 


Lemma A.6. (Farkas) Let r,b as above, then Ft < b has non-negative solution if and only if every non-negative 
Denote by 


s G g^p > Q satisfies s^b > 0. 


With these notations 


\ 

Sm 

y 


s^F > 0 


p(nm+^) 


with Si,Sm € and r € 


> sJLP V re = 1 ,m 
Vy > max s^Lpy Vy = l,...,£ 


where Vj are the components of r. Similarly, 


s^b > 0 r^e>^sjLqu,. 




(A45) 


(A46) 


(A47) 


Note that if (A47) holds for r with Vy = maxu,g{i_..._m} s(,Lpy, then it also holds for r with Vy > max„g{i_,,._m} s^Lp^. 
Therefore, defining the matrix S = [si, G we get that Q -P if and only if for all S G 


W max s^Lpy > Tt{S'^LQ). 


y^l 

Denoting by A = S^L the above equation becomes 

£ m 

V max a^pj^ > Tr(AQ) = V a^q„ 

—■ ujG{l,...,m} —' 

y—1 w—1 

Since a^, = a^ and Py = pf the equation above is equivalent to 


(A48) 


(A49) 


Note that also 


y—1 




W—1 


W — 1 


(A50) 


(A51) 


Therefore, the condition in Eq. (A36) is stronger than one given in Eq.(A50), and yet Eq. (A50) is sufficient for 
conditional majorization. 

Several special cases of n, i and m were considered in the main text. The case £ = 2 is the most involved of 
these special cases, and so here we look at this case in detail. We assume that Q = and P = are 2-row joint 
probability matrices. 

Eor £ = 2, Q P if and only if there exists two m-dimensional probability vectors a and b such that LQ < LPT, 
where 


T = 




(A52) 
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We denote by = Yllc=i for w = 1, m, and similarly by py = X]”=i P^y for 2 / = 1, 2. Since pi + p 2 = 1 we 
simplify the notation and denote pi = p and p 2 = 1— p. With these notations the inequality LQ < LPT is equivalent 
to 


( 7 u,Lql“'< + 6u,(l — V ru = 1,m. (A53) 

Note that the last component in the vector inequality above reads 

qw < a^P + - p) Vw = l,...,TO. 


This is possible only if 


q-w = a^p + buiil -p) 


for all w = Substituting this into (A53) we conclude that Q 

vector a satisfying the following conditions: 


(A54) 

P if and only if there exists a probability 


, q-w 

Ojqn 


«„L(p|i-p|2)>^L(ql“-p|2) 


P 

We now prove the following theorem, stated in the main text: 


(A55) 

(A56) 


Theorem Let P and Q be n x 2 and n x m probability matrices given in their standard form. Let p and he as 
defined above. Define 


Pk = ^ {Px\i - Px\2) and ixf’ = ^ {qx\w - Px\2) ■ 


(A57) 


X — 1 


X — 1 


Denote byl~^, , andl the set of indices {A:} for which pk is positive, zero, and negative, respectively. Furthermore, 
define: 


_ qw I r, 

a,„ = — max < 0, max 




P 


kei+ pk 


; fiyj = — min -I 1, min 


{w) 


p 


and through these. 


W+{P,Q) = l-Y,<^n,-, W.{P,Q)=(J2 


k^X jl}- 


- 1 ; 


Wq{P,Q) = - max Wi{P,Q) = min (/3„ - a^,). 

Then, Q <c P if and only ifWo, Wi, W.^., and W_ are all non-negative. 


(A58) 

(A59) 

(A60) 


Proof. To prove the theorem we need to show the following equivalence: (A55|-(A56) (A59)-(A60|. To prove the 

implication we assume (A55)-(A56) and express the condition (A56), using notation introduced in (A57), as 

(A61) 


awPk > V k,'iw. 


Next, we consider the three partitions of the set {k} that together cover the whole set: 

1. k G be., pk = 0. 

Then for all w = 1,..., m. 


< 0 V (fc G J°) ^ ^ max^'") < 0. 

p p kei° 


Since ^ is positive for all w, we get that Wo{P, Q) = 


max^.fcgxoji^fc I 


is non-negative. 


(A62) 
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2. fc S ,i.e., /ifc > 0. 
Then for all w = 1,..., m 


(Xyj 


{w) 

^^>0 Vfcex+. 
V Mfc 


But since > 0 must also hold, we have 


(A63) 


o-w — o-w > 0. (A64) 

Summing over w and requiring that 1T-|- = 1 — must be non-negative. 

3. fc G X“, z.e., /ifc < 0. 

Then for all w = 1,..., m 


(w) 

Qw Vk 
P Pk 


— Ou, > 0 Vfc G X . 


But (A55) states that au, < qw/p must hold. Therefore, 

i^W ^ 0- 

Again, summing over w and requiring J2w=i ^ entails that W- = /3tu ~ 1 be non-negative. 


(A65) 


(A66) 


Finally, combining (A64) with (A66l for all w, we obtain the condition VFi = min„g{i_ (/?„, — a^,) > 0. This 
completes the proof of one direction. 

To prove the other direction, define the matrix-valued function 0 : —>■ given by 


0(v) = 


Vl 

qi-pvi 

. i-P 


V2 

q2-pv2 

1-p 


'^m 

qm—pVn 

1-p 


(A67) 


Each such matrix satisfies all the properties desired of the matrix T in (A52), except possibly row stochasticity. Now, 


the first-row sum is a continuous function of v, monotonically increasing in each Vw Moreover, by construction, 

if the first row adds up to 1, so does the second row. Let a = (ai... a^), and similarly define (3. By the premise, 
0(a) and 0(/3) contain only nonnegative entries. Furthermore, = 1 — < 1 and J2w = W- -I- 1 > 1, 

and j3w > Qftu for all w (since Wi > 0). The intermediate value theorem then ensures the existence of some v such 
that 0(v) is a row-stochastic T with the desired properties. □ 


Appendix B: Quantum-conditional majorization 

Let = J22=o ^x\x){x\^ be a classical-quantum state where {cr^} are n normalized states acting on Bob’s 
Hilbert space Ti.^ with dim’H'® = d. Each can be written as: 

d— 1 d —1 

^ ^y\x\y T {y 1 ^ ^ Qy\x\'^y\x) ('^y|a:| 

y^o y^O 

where {\x;y)}yZl are the eigenvectors of that corresponds to the eigenvalues Xy^^. The set of vectors {\uy\x)^}yZl 
generate another possible decomposition of In general, the set {\uy\x)^}yZ^ is not orthonormal and in fact, in 
special cases it is even possible to have \uy\x) = \uy'\x) for y ^ y'. 

Lemma Let be a density matrix and denote by = (Ao|a;,..., A„_i|a;) its vector of eigenvalues. Then, there 
exists states such that trf = if and only if 


(B2) 


where ql"’ = (gol®, 9n-i|a:)- 


ql’" ^ AI“, 
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Let be another classical-quantum state given by 

= a;|0)(0|^ 0 |0)(0|^ + (1 - a;)|l)(l|^ ® |V')(V’I^ (B3) 

where is some fixed state in Bob’s Hilbert space, T-L^. We also completes |0)® (i.e. the state appears in to 

an orthonormal basis which we denote by {\y)^}yZo for all y = 0,d — 1. In the theorem below assume that d = n, 
since if d < n we can always embed in C", and if n < d we can always add terms to with zero probabilities. 
We will therefore use the same index n for both d and n. 

Theorem Let and be classical quantum states as above, and for each let {qy\x, \uy\x)}^fQ one of its 
many decompositions. Denote by Cy = {y\tj}) the y component of\tjj) when written in the basis {|?/)^} discussed above. 
Finally, for any j, k G {0 ,n — 1}, with j k, denote by 


’’’v' ^ — %\k\{Uy\j\Uy\k)\ 


n — 1 


^O.fe) ^ '^qy\k\{Uy\j\Uy\k)? 
v=0 


Ojk 




{y\ 


mm 

..(rk) 


> 0 } 


%\j 1 

D3,k) f 

'v ) 


(B4) 


Then, -<c if oj < and there exists j,k G {0, ...,n — 1}, with j k, such that the following condition 

hold: 


W = |4'jfe(w)) = ^ + \/l - w - |1)^^ 


(B5) 


Proof. Without loss of generality, take j = 0 and k = 1. Let Af be a CQO. Then, 

n—1 n—1 

j,x—0 j,x—0 

where is a CPTP map. We now consider a specific CQO for which D^J = Sqx, D^J = di^, D^J = Sqx, 

and for j > 1, D^J = 6xj. Furthermore, we will only consider trace decreasing CP maps such that £*^fi(|0)(0|) is 
zero unless j = 0. Therefore, for these choices we get: 

= a;|0)(0|^ (|0)-®(0|) 

+ (i-cc)|0)(0|^o£:«(|V')^(V>|) 

+ (i-o;)|i)(i|^o£:(°)(|v^)^(V^|) 

n— 1 

+ (l-o;)^|x)(x|^0£(")(|V^)^(V'|) (B7) 

x—2 

Consider now the following choices of . Each is given by 

n — 1 

£^^\-) = T.^y\xi-)K\^ (B8) 

v=0 

where the Kraus operators Ky^x have the following form: 

n—1 

"^y|0 \/^l^y|o) (01 F ^ ( Oyz {z\, Ky^i ^^l'^y\'0'y\o) (1| and for X > 1 ^y\x (^1 (B9) 

Z = 1 

where Sy,ty are probabilities, for each y the n—1 vectors {\vz\y)]:f:Zi are orthonormal and perpendicular to |ny|o)j 
and the non-negative numbers bx and the complex numbers Oyz satisfy the relation: 

n—1 

da; + |aya;p = 1 forall x = 1,..., n — 1. 

y=o 


(BIO) 
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Note that the above relations ensure that J2y x^o ^l\x^v\^ ~ therefore get so far that: 

A/'(f2^®) = [w + (1 - a;)6i|(l|i/’)P] |0)(0|^ (g) cr® 

n—1 

+ {1 - uj)'^b^\{x\tp)\'^ \x){x\^ <S) a. 

x^2 

where we have chosen the probabilities ty and Sy such that 


B 


%\o 


ujty + (1 - u;)bi\{l\ip)\'^Sy 


w + (1 - a;)5i|(l|V')p 

It is therefore our goal to get {tp\) proportional to erf. Denote 

n—1 




2 = 0 


Then 


^y\o\'^) ^0\/^|'^y|o) T E Clyz Cz\Vz\y) 


z=l 


We would like it'j,|o|r/’) to be proportional to the state |My|i), and more precisely, we want 

Ky\o\ip) = 9^f^\uy\i) for some 5 e C. 

We therefore write each in the basis 

Wv\i) = v^l«y|o) + 


(Bll) 


(B12) 


(B13) 


(B14) 


(B15) 


(B16) 


where Ij/-*-) is some normalized state orthogonal to |wy|o), and Vy S [0,1], 9y S [0, 27r]. Note that the set of states {Ij/'*")} 
are completely determined by {|uy|i)}. We choose |ni|j,) = |y''). For this choice we then must choose ayz = ay5zi- 
For these choices we then have 


Ky\o\i^) = coV^|wy|o) + ayCi\y^) 

9V%\^V^ = CovE and - Xy = QyCi 

iing the 
Co 


Hence, from (B15) we have 


Note that we can always take ay = e*®|ay| to absorb the phase, and by taking the ratios we get 


9\/9y\ly/^ Co-\/ty 


|C-ly|Cl 

\/l^ 


The first equation gives: 


ty — 


9y\l^V 


E n—1 

z=o 9z\irz 


9^ 


n — 1 

= E9-I 

z=0 


irz 


For this value of ty, we must have Sy > 0. This leads to the first constraint on w, qy\Q, 9y|ii and Tz 


w yqyio 

The second constraint comes from 


E n— 1 

z=o <lz\irz 


+ 9y|o(l ~ w)^l|(l|'*/')l ^0 


Ty Cl 


(B17) 

(B18) 

(B19) 

(B20) 

(B21) 

(B22) 
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Therefore, the sum over y gives 


'1 2 
,2 9 


"•-1 \ / ^2 

9 1. 0 ( 


y=o "=1 V y=o J \ 9 

We therefore get the constraints on the coefficients cq, ci, and g: 


9 Cq 


^ z=0 


With these choices we get 


(B23) 


(B24) 


= [oj + {1 - u)h\{l\^)f] 

+ (l-a.)g 2 |l)(l|^ 0 af 

n—1 

+ (l-w)^ |(x|V')na;)(xr-(»cr^ 

a ;^2 

where we substitute = 1. Therefore, 

go = w + (1 - a;)5ici gi = (1 - w)g^ Qj; = {1 - oj)\cx\'^ {x > 2) 

Now, 


n—1 

6 l = 1 - ^ lOyP = 1 - 

y=o 


2 _ 2 ,.2 I ^2 _ 2 

9 Cq _ Cq -h C]^ g 


Hence, 


To summarize: 


go = w + (1 - w) (co + c? - g^) 

go + gi = w + (1 - w) (cq + c\) 
gi = (1 - a;)g2 

gj, = (1 - a;)|ca;p for a; = 2 ,...,n - 1 

n —1 

^0 ~ 9 ^ ^ 9y\lXy ; Cy = I (My|o|My|i) I 


2=0 

2 ^ 2 , 2 
5 < Co + Cl 


9y|0 


gi/|iCj/ 


+ <Zy|o(l ~ ^)^l|(l|V')P ^ 0 


(B25) 

(B26) 

(B27) 

(B28) 

(B29) 

(B30) 

(B31) 

(B32) 

(B33) 

(B34) 


E n— 1 

2=0 9z\ir. 

After some simple algebra, these conditions can be shown to be equivalent to the condition of the theorem. □ 

Corollary Using notations as in Theorem^ if all = \4>x)^{(t>x\ then for any j G {0, 1} and to G [0, g^] 


CO 




|0)(0|« + (1 - a;)|l)(l|^ 0 (B35) 


where 


and 


vi -< 


(cj 10 )-® + 11 )-®) 


Cj = max|((/)i|(/ifc)|v^ 


(B36) 

(B37) 
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Appendix C: Certainty accessible via Bob’s measurements 

We consider here the case of a classical memory Y that results from Bob measuring a quantum B. The essential 
idea is to take the best possible measurement on B and then use the previous section’s methods on the resulting CC 
state. To this end, we define the following. 

Definition Let (TxB G CQ" result from the measurement Mx on system A of pab- The minimal conditional 
uncertainty associated with the measurement Mx given accessibility to the quantum system is a function Mu : 
C Q" —M defined by 


Mu (ctxb) := mill Ux\y (P) , 

{My} 


(Cl) 


where the minimization is over all POVMs {My} describing Bob’s measurement, where P G CC" results from axB 
through a particular choice of this measurement. Ux\y is a classical measure of conditional uncertainty as in Defini¬ 
tion [5l 

We now prove a lemma in this connection, which was stated in the main text. 

Lemma 


10 


It is sufficient to take the minimization in (Cl) over all rank 1 POVM measurements. 


Proof. To prove the statement we need to show that 


min Ux\y{P) ^ min Ux\y{P'), 

{My} {r„,} 


(C2) 


where {My} stand for all rank-1 POVM measurements, whereas {fy'} for arbitrary POVMs. To this end, we can 
define general POVM elements as a linear combination of rank-1 POVM elements Tyi = Pyy'My, where Ryyi are 
components of row-stochastic matrix R {J2y' Pyy' — !)• Then the components p'y.y, of matrix P' are given by 


xy' 


= Tr 


PAb{M^ ® fy') = ^ Ryy' W [PAb{AI^ ® My)] = ^ Ryy'Pxy, 


(C3) 


where in the second equality we used linearity of the trace. Now, since p^y are the components of the matrix P, we 
have that P' = PR. From the property (1) of a measure of the uncertainty of X\Y (see Definitionj^ we have that it 
is nondecreasing under processing of Y: 


UxlY{P)<i^X\YiPR), (C4) 

and after minimizing over POVM measurements we get ( |C2[ ). □ 

Lemma C.l. Mu is monotonically non-decreasing under QCR. 

Proof. To prove the statement we need to show that 

min Wx|y(P) < min Z^x|y(Q), (C5) 

{My} 


where P (Q) is a matrix of probabilities Pxy idx'y') obtained by measuring POVM elements My (Py') on a state axB 
{N{(jxb))- 

First we will express the probability Pxy in the product form 

n 

Pxy = Yr(^px'\x'){x’\® ax'){\x){x\® My) 

x' — l 

= Yvpx\x){x\® {axMy) 

= PxTraxMy 

PxPy\x' 


(C6) 
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Let us now take the POVMs Ty/ 
from 


that gives the minimum in the RHS of (C5). The probabilities qx'' 


are calculated 


Qx'y' 


TTjV{axB)i\x'){x'\ (g) fj^') 

n 

x,x" — l j 
n 

Tr^ ^ (g) 

X = 1 j 

n 

j X = 1 


(C7) 


We will now show that for any POVM measurements {fy} and trace preserving operation '^jK]{-)Kj there exist 
POVM measurements {My} and a set of matrices with non-negative entries with the property that jg 

row stochastic, such that 

K]yy'Ky=YRwMv, (C8) 

y 


where R^yy, are the components of the matrix To show this dehne the POVM elements My and the matrices 

as the following 


My 

= M(^z,k) ■- 

= Klt.Kk, 

dO) 

^yy' 

— p C J) 

- ^{z,k)y' 

■— ^zy'^jk- 


Note that such defined My is a valid POVM: 


while J2j is row stochastic: 


Having the above we can write 


hence for such defined My 


YMy = Y Klr.Kk = E 

y z,k k 


E E = E E = 1, vy. 


TT{axK]fy,Ky) = 




My), 


n I 

^^'y' = E E E 

j y^l 

n I 

= E E E ^x'lP-Pyl^^yy' > 

j X=1y=l 


(09) 

(CIO) 

(Oil) 

( 012 ) 

(013) 

(014) 


which means that the matrix Q can be expressed as Q = D^^'>PR^^\ Recall that the measure of the uncertainty of 

V|V is non-decreasing under correlated classical processing of Y with random relabelling of X and since we assumed 
that Py' were optimal we get 

Ux\Y{P)<MinUx\Y[Q)- (CIS) 

{r„'} 

Therefore, minimizing Ux\y{P) over all POVMs completes the proof. □ 
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Appendix D: Joint conditional uncertainty 


Since the uncertainty principle is about more than one potential (actual or counterfactual) measurement on Alice’s 
side, we require a notion not of the conditional uncertainty of a single measurement, but of the joint conditional 
uncertainty of two or more measurements. We would like first to define the most general notion of joint conditional 
uncertainty that is independent on whether the two measurements on Alice’s system are actual or counterfactual, 
and is not restricted to a particular scenario or task. We therefore only require from such a measure to be monotonic 
under joint QCR: 

Definition!^ ^ : CQ" x CQ^ —?> K, and let axB € CQ^ and jzc S CQ"*. Then, J \s & measure of the joint 
conditional uncertainty if: 

(1) J [(JxBtIzc) = 0 if both axB and 72 c are product states of the form |0)(0| ® p. 

(2) J {(^XB,Jzc) > J {d'xB',lzc') if both axB dxs' and 'jzc -<c Izc- 

One can construct measures of joint conditional uncertainty from classical measures. This can be done as follows. 
Let : CC" x CC™ —)■ M be a measure of classical joint uncertainty (i.e. satisfying the conditions of Def. for 
classical states /distributions P and Q). Then, 


11 


Theorem 

be two POVMs, 


Let (JxB 
and let 


CQL" and jzc G 
= (Pxy) and Q 


CQ"^ be two classical-quantum states, let {My '} and 
= (Qwz) be two probability matrices with probabilities Pxy = 


{M^} 


Tr 


crxB (\x){x\ andqzw=Pr "fzc {\z){z\® Mw'^\ . Then, the function 


J {oxB.lzc) ■■= min ^(P,Q) 

M(0),M(i) 


(Dl) 


is a measure of conditional joint uncertainty. The proof follows from the same idea as in Lemma |C.1| 


Appendix E: Miscellaneous auxiliary results for universal tripartite relation 

Here we include a couple of sundry lemmas that are useful in proving our universal tripartite minimum classical 
uncertainty relation in Section |III| First, we state without proof the following elementary application of Markov’s 
inequality: 

Lemma E.l. Let Q G CC". Suppose that 


Qw max qx\w<r (El) 

W 

For arbitrary /? > 0, define Wp = {w : maxqx\w < /3}. Then, 

<lw P ^ — -Q- (E2) 

Recall the scenario in Fig. Alice, Bob, and Eve initially share two copies of a pure state = Idt) ('I'|, i.e., a state 
\^AiBiEi (g) Alice performs measurements {Ma,} and {Ma^} on her two copies. Bob performs {Mf,} on his 

first copy and does nothing with his second. Eve does nothing with her first part, and measurement {Mgj on her 
second. We are interested in the following quantity, for a given pair of {Ma^} and 

T] = max max V] qtPe maxqapbqa 2 \e (E3) 

'I' {Afj,},{Me} 01,02 


Lemma E.2. p = |:[1 + c]^. 

Proof. Consider the following auxiliary quantities: 


flower — 


max max 

{Mb} ,{Me} 


^QbQe 

0 . 1 , 0.2 
b,e 


where the maximum is taken over all product states, and 

?7upper=max max V] V] 9e'|b9b'|e maxg^ |be'(?a 2 |h'e> 

I- {Mb},{M„}^ ^ ' ' 01,02 

6,e b',e' 


(E4) 


(E5) 
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where 


Qe'\b 

Tr [^'(7 (g) Mb (g) Mg/)] 

Tr[^'(J(gMb(g/)] ’ 

(E6) 

Qb'\e — 

Tr [^'(7 (g) Mb/ (g) Mg)] 

Tr[T(7(g7(gMg)] ’ 

(E7) 

Qai\be' 

Tr [^'(Ma, (g Mb (g) Mg/)] 

Tr [4'(7 (g Mb (g Mg/)] ’ 

(E8) 

Qa2\b'e 

Tr [^'(Ma 2 g Mb/ g Mg)] 

Tr [^'(7 g Mb' g Mg)] 

(E9) 


It is clear that ??iower < ?7, while since 




max le max y Qe'\bQb'\eQai\be'Qa 2 \b'e — 

01,02 01,02 

b',e' 


/ , b^b' e max Cla 2 b'e 

^ ' 01,02 
h' ,e' 


(ElO) 


we also have that Tyupper > V- We will show that ryiower = i[l + c]^ and also ryupper < + c]^, where c is maximum 

overlap of basis vectors defining {Ma^} and {Ma 2 }, and since ryiower < ?? < VnppeT we immediately get rj = |:[1 + c]^. 
Calculation o/ryiower- 
We have the following 


max inax ^ V gb<?e maxq„^|bga 2 |e 

{Mb},{Mg} ^ ^ 0'i^0'2 

b,e 

(Ell) 

max max q^Qe max qa 2 

{Mb},{Mg} 0 , 1 ,a2 

b,e 

(E12) 

max maxgaiQas max qt,qe 

0-1,0-2 {Mb},{Mg} 

b,e 

(E13) 

maxmaxgaiga2 

'ifjA ai,a2 

(E14) 

lil + c?, 

(E15) 


where in the first line we used probability independence on conditioning for product states, while in the last line we 
used the result from [5]. 

Calculation o/ryupper- 
We have the following 


Vnpper = Hiax max ^ ^gb'ye V'ge'|b9b'|emaxqapbe/ga2|b'e 

b,e b',e' 

= max rnax V gbe'^b'e max(y„^|bg,g<j 2 |b'e- 

^ {Mb},{Me} Q'1-,Q'2 

b,e,b' ,e' 


Notice, that we can use the pairs of indices {be') and (b'e) interchangeably: 

max niax 'V' g^'^b'e maxgi^ibe/gQ^lb'e = max inax 'V] qveQbe'qa’ \b'eqa'\be 

'I' {Mb},{Me} 0,1 .rto >]/ SM,\SA/r\ n' n' 


^ {Mb},{Me}, 


b,e,b',e' 6,e,6',e' ^ ^ 

and we can symmetrize the maxima over (ai, 02 ) (for any given set (6, e, b', e')) by writing the following 


??uDDer = max max 


V qbe'qb'e^ ( maxq„^|hg/gg2|b'e + maxgaMf,,gg„/|bg/ 
b,e,b',e' ^ ^ 


(E16) 

(E17) 

(E18) 

(E19) 

(E20) 


The main advantage of doing so is that in comparison with maxai,a 2 9 oi|b 9 o 2 |e m the definition of ry (E3), we no longer 
need to tackle the probabilities qax\b and qa 2 \e conditioned independently on Bob’s and Eve’s outcomes. Instead, we 
now only need to evaluate ^ (maxai,a 2 9oi|be'9a2|b'e + maxg/^ a^ 9o'j|b'e9oybe') for each setting {b'e, be'). In this case any 
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pair of settings (b'e) (and accordingly {be')) leads to preparing a particular state ip {(p) on Alices’s side. Thus, we 
need to find the maximum of 


= o niaxgr(ai|V’)(?(a2|((') + uiaxq{a[\(p)q{a2\'tp) ■ (E21) 

Z \0i,02 o'j,o^ / 

over all states ip and (p. Suppose then that the is attained for the vector corresponding to a particular al in 

the first maximum. The second term is then maximized by choosing the vector corresponding to a'^ for which the 
overlap |(a*|a 2 *)| is maximal with respect to other choices of 02 . This is guaranteed due to nonuniqueness of the state 
Ip for which the optimal probability q{al\ip) is drawn. W.l.o.g. we can set a* = a'p and 02 = a '2 , thus the four vectors 
corresponding to al,al,ip,(p become coplanar. Let us denote the following angles: 


then 


i>4 


o,ip 2 


Zial,a*2) = ae [0,f], 

(E22) 

Z{al,ip) = 0 e [0,a], 

(E23) 

= p & [0,a], 

(E24) 

(cos^ 0 008 ^ p + 008^(0 — a) cos^((/3 — a)). 

(E25) 


Notice that a function of the form cos^(0 — a) cos^{ip — a) for any given parameter a has a maximum for 0 — (p = 0, 
therefore 


max i (cos^ 0 cos^ p + cos^(0 — a) cos^(p — a)) = max ;^(cos^ 0 + cos^(0 — a)). 

9,(p 2 0 2, 

By differentiating the last expression with respect to 0 and equating to 0, we arrive at 

cos^ d sin 0 + cos^ (0 — a) sin(d — a) = 0. 


(E26) 


(E27) 


Since cosines are nonnegative, the only nontrivial solution is d — a = —0. Thus, for 0 = ^ we obtain 

max ;^(cos‘* 0 + cos^(0 — a)) = ;^(cos^ ^ + cos‘^(| — a)) = ^[1 + coso?]^. (E28) 

0 Z Z 4 

Recall that a is chosen such that from the definition it maximizes the overlap between the vectors corresponding to 
Oj and 02 , and so cos a = c. 

Notice that in fact we obtained < j[l + c]^ which immediately leads to conclusion ?7upper < q[l + c]^. □ 



